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Abstract 

It is common to use a compartmental, fluid model described by a system of ordinary differ¬ 
ential equations (ODEs) to model disease spread. In addition to their simplicity, these models 
are also the mean-field approximations of more accurate stochastic models of disease spread on 
contact networks. For the simplest case of a stochastic susceptible-infected-susceptible (SIS) 
process (infection with recovery) on a complete network, it has been shown that the fraction of 
infected nodes converges to the mean-field ODE as the number of nodes increases. However the 
proofs are not simple, requiring sophisticated probability, partial differential equations (PDE), 
or infinite systems of ODEs. We provide a short proof in this case for convergence in mean- 
square on finite time-intervals using a system of two ODEs and a moment inequality and also 
obtain the first lower bound on the expected fraction of infected nodes. 


1 Introduction 

Interactions among individuals impact the transmission of infectious diseases, and the structure of 
the contact network imposes an important constraint on the transmission dynamics (Keeling and Earnes 
2005). Models used in epidemiology to study disease transmission include mean-field models using 
ordinary differential equation (ODE) (Anderson and May 1991; Earnes and Keeling 2002), Markov 
or state-based models (Hailey 1975), and individual-based simulation models (Goodreau et al. 2012; 
Keeling and Earnes 2005). These capture the contact network structure at different complexity lev¬ 
els and differ in analytical tractability, computational performance, and accuracy (Keeling and Grenfell 
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2000; Keeling and Eames 2005; Bansal et al. 2007). When choosing a particular type of model, 
whether mean-field models or individual-based simulations, the researcher should not only take 
into account the disease being transmitted, the size of the target population, the availability of 
data about the underlying network structure, the desired level of analytical tractability, and com¬ 
putational performance, but also the accuracy of the chosen model (Bansal et al. 2007; Keeling 
2005; Keeling and Eames 2005). 

The vast majority of studies, which assessed the accuracy of mean-field models, only relied on 
numerical experiments (Simon and Kiss 2012). However, the rigorous analysis of the accuracy of 
mean-field models requires deriving a mathematical link between mean-field approximations and 
the exact Markov models. Kurtz (1970, 1971) are the first to show this (see also Ethier and Kurtz 
(2005)). They use operator semigroup and martingale techniques to prove that the exact models 
converge in probability over finite time-intervals to the solution of the mean-field ODE, using as 
a specific example the stochastic SIS-process on a complete network with increasing number of 
nodes. Simon and Kiss (2012) summarize the proof in Ethier and Kurtz (2005) without going into 
the technical details. 

Simon et al. (2011) showed for the same setting as ours (i.e., a stochastic SIS-process on a 
complete network) based on the idea outlined in Diekmann and Heesterbeek (2000), that with 
increasing number of nodes the expected infected fraction converges to the solution of the mean- 
field ODE model using a partial differential equation (PDE) approach. This approach exploits the 
fact, that for a large number of nodes n, the discrete probability distribution Xk(t) of the Markov 
model (i.e., x^it) denotes the probability of having k infected nodes) can be approximated using a 
continuous density function p(t, k/n). The convergence is then proved by showing that the expected 
infected fraction of the resulting first-order PDE converges both to the solution of the mean-field 
model and to the expected fraction of the Markov model for large n. A more detailed description 
of this proof is given in Chapter 2 in Taylor (2012). 

Motivated by the first-order PDE approach and the stochastic approach, which both draw from 
multiple different mathematical areas, Simon and Kiss (2012) introduced an ODE-based approach 
to show that the expected infected fraction of the Markov model converges uniformly on finite time- 
intervals to the solution of the mean-field ODE model. Establishing an infinite homogeneous linear 
system of ODEs to approximate all moments of the probability distribution Xk(t), Simon and Kiss 
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(2012) show using a perturbation theorem for the infinite system of ODEs that the mean-field 
solution is an upper bound of the expected infected fraction in the Markov model. Further, they 
provide a bound dependent on the network size n for the absolute difference in the expected infected 
fraction between the mean-held model and the Markov model. Note that neither the PDE nor the 
infinite ODE approach show that the distribution converges to the mean-held model. 

In this paper, we provide a short proof showing that as the population size increases, the dynam¬ 
ics of the infected fraction of the stochastic SIS-process on a complete graph converges uniformly in 
mean-square on finite time-intervals to the mean-held ODE model. We also provide the hrst lower 
bound for the expected fraction of infected nodes. Note that convergence in mean-square implies 
convergence in probability and hence in distribution, thus providing a stronger result than previous 
proofs of Kurtz (1970, 1971), Ethier and Kurtz (2005), Simon et al. (2011), and Simon and Kiss 
(2012). More importantly, we only use a system of two ODEs, basic ODE techniques, and Jensen’s 
inequality in our proof. We hope our short proof using only elementary tools opens up the held 
to more applied researchers that might not be comfortable using martingale or PDE theory, pro¬ 
vides the basis for further mean-held convergence results, and increases understanding about the 
performance of mean-held models. 

2 Mean-square convergence result 

Consider a complete graph of n nodes and let X n (t) be a Markov process describing which nodes in 
the network are infected at time t. An infected node infects each neighbor at a rate r/n and recovers 
at a rate 7 . We let I n (t) := num/(X n (i)) be the number of infected nodes and i n (t) := I n (t)/n the 
infected fraction. We may sometimes drop the dependence on t and n. 

Our goal is to prove the following theorem. It shows that the infected fraction converges in 
mean-square on hnite time-intervals to the mean-held approximation as the population increases, 
and it gives upper and lower bounds on the expectation of the infected fraction. 

Theorem 1 . If i n ( 0) = u for all n where u G [0,1], then as n — >■ 00 , i n (t) converges uniformly in 
mean-square, i.e., 

E[\i n (t) - y{t)\ 2 \ -> 0 


3 


on any time finite interval [0, T] to the solution of the mean-field equations: 



y' = ry{ 1 - y) - 7 V, 2 /( 0 ) = u. 

(1) 

In addition we 

: have the following bounds for t > 0, 



y(t) > E[i n (t)\ > zi >n (t), 

(2a) 


Z2,n{ f ) > E[i n (t) 2 } > Z hn (t) 2 , 

(2b) 

where 

Z 2 ,n(t)) solves the initial value problem: 



A,n = T {zi,n - 22,ra) ~ 7^1, n, 

(3a) 


4,n = 2 r (z2,n ~ 4n) ~ 27^2,n + (1 /w)(t + 7 ), 

(3b) 


2l,n(0) = U, Z 2 ,n{0) = U 2 . 

(3c) 


Of particular interest is the lower bound on E[i n ] in (2a). All the bounds in (2) are new, except 
for y(t) > E[i n (t)\, which can also be found for example in the proof of Theorem 3.1 of Ganesh et al. 
(2005) or Proposition 5.3 of Simon and Kiss (2012), but even there, our approach is new. 

3 Proof of Theorem 1 

Since the theorem holds trivially for i n ( 0) = u = 0, we now assume u > 0. To deal with an¬ 
other technicality, the initial value problems (1) and (3) have unique solutions because their right 
hand sides are smooth (i.e., continuously differentiable). Since y is deterministic, we will show 
mean-square convergence by proving that E[i n (t)\ —> y(t ) and E[i n (t) 2 ] —»• y(t) 2 (or equivalently 
Var[i n (t)] —>• 0), uniformly on [0, T]. We show these limits by proving the bounds (2) and that 
Zi ,n{t) -t y(t) and z 2 ,n(t) ->■ y[t) 2 . 

Let SI(t ) := num 5 /(T(l)) be the number of edges in the network with both a susceptible 
and infected endpoint. We start with the following standard proposition, which we prove in the 
appendix. 
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Proposition 2. E[I]' = (r /n)E[SI] — 7 E[I] and 


E[I 2 ]' = ( T/n)E[SI((I + l ) 2 - I 2 )] + 7 E[/((/ - l ) 2 - I 2 )}. 

In the case of a complete graph, SI = I(n — I). Substituting into Proposition 2 and using 
i = I/n, we have 

E[i]' = r{E[i] - E[i 2 ]) - jE\i], (4a) 

E[i 2 ]' = 2r(E[i 2 ] - E[i 3 ]) - 2 7 E[i 2 ] + (1 /n){r{E[i] - E[i 2 ]) + ' yE[i ]). (4b) 

Applying Jensen’s inequality, E[z ] 2 < E[i 2 ], to (4a), 

E[i]’ < tE[{\( 1 - E[z]) - jE[i\. (5) 

Then a standard ODE comparison theorem, Lemma 3, justifies our intuition that the solution 
to the mean-field equations (1) is an upper bound: E[i{t)\ < y(t) for t > 0. 

Lemma 3. Suppose that a(t ) and b(t ) are scalar; f is smooth; a(0) < 6(0); and a'{t) < f(a(t)) and 
b'(t ) = f(b(t )) for 0 < t < T. Then a(t) < b(t) for 0 <t <T. 

Proof. See any graduate ODE text such as Theorem 6.1 in ale (2009). □ 

To obtain a lower bound we apply Jensen’s inequality, E[i 2 ] l ' b < E[i 3 ] and E[i} 2 < E[i 2 ], to 
(4b), 

E[i 2 }' < 2r(E[z 2 ] - Eli 2 ] 1 ' 5 ) - 2 7 E[i 2 ] + (l/n)(r£[i](l - E[i\) + 7 £[i]). ( 6 ) 

Using the fact that E[i\ and 1 — E[i] are in [0,1], we obtain 

Eli 2 ]' < 2r(Eli 2 ] - E^ 2 ] 1 ' 5 ) - 2 7 E[z 2 ] + (l/n)(r + 7 ). (7) 

Applying Lemma 3, we have E[i n (t) 2 ] < Z 2 , n {t) fc> r t > 0, where Z 2 , n (t) solves the initial value 
problem: 

4 ,n = 2 r(z 2 ,n - zl' 5 ) - 2-fz 2 ,n + (l/n)(r + 7 ), z 2 , n ( 0 ) = u 2 . ( 8 ) 
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Now going back to (4a), we have a lower bound for E[i]' 


E[, i]' > r(E\i] - z 2 , n ) - 7 £[i] (9) 

Applying Lemma 3 here by treating z 2 , n (t) as an exogenous function, we have E[i n (t)] > z\ n (t) for 
t> 0 , where z\ tn {t) solves the initial value problem: 

A,n = t(Zi ,n - z 2 ,n) ~ 7^1, n, U,n(0) = U. (10) 

Our last lower bound E[i%] > E[i n ] 2 > £ 2 n then follows from Jensen’s inequality. 

Having proven the bounds, the next step is to show that zi iTl (t) —> y(t ) and z 2jH (t) — > y(t ) 2 
uniformly on [0,T]. Note that z = (y,y 2 ) is a solution to the system, 

A =t(z 1 - z 2 ) -jzi, (11a) 

A = 2 t (^ 2 - z 2 - 5 ) - 2 7 2 2 , (lib) 

= u. z 2 = n 2 , (He) 

because ( 11 a) and ( 11 c) become ( 1 ) and the right hand side of (lib) ends up being 2 yy', which 
equals ( y 2 )' as we require. Uniqueness holds because the right hand side is smooth. Substituting 
e = 1/n, the right hand side of (3) is also smooth in e and it converges uniformly to the right 
hand side of (11) as e —> 0 (i.e., n —> oo). Then by the results about the continuous dependence 
on parameters, the ODE solutions converge, (zi tn ,z 2tn ) —> ( y,y 2 ) uniformly on [0, T], proving the 
claim. 

4 Conclusion 

We gave a short proof showing that as the population size increases, the dynamics of the infected 
fraction of the stochastic SIS-process on a complete graph converges uniformly in mean-square on 
finite time-intervals to the mean-field ODE model. Using the ODE from Proposition 2 to describe 
the dynamics of the first moment (the expected number of nodes infected) and Jensen’s inequality 
to approximate the second moment in this equation, we showed that the solution of the mean-field 
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ODE model provides an upper bound to the expected infected fraction. Further, we established 
the first lower bound on the expected fraction infected. The lower bound is from a system of two 
ODEs: the ODE describing the dynamics of the first moment combined with an ODE describing 
the dynamics of the second moment, which we bounded using Jensen’s inequality. Finally, uniform 
convergence in mean-square was shown by proving that the lower bound (i.e., the solution of the 
system of two ODEs approximating the dynamics of the first two moments) converges to the infected 
fraction and its square in the mean-field ODE model. Thus, the approach presented provides both 
a stronger convergence result compared to previous approaches showing convergence in probability 
or convergence of the expected value as well as a shorter, more elementary proof. 

Mean-field models appear in many fields, not just epidemic modeling, and if such an elementary 
approach to proving convergence is generalizable, it would have many users. However, one limitation 
of our approach is that Lemma 3 about ODE inequalities only holds for scalar ODEs and the 
argument we used relied on the second equation not depending on the first. In cases where a 
mean-field model leads to a bound involving a system of more and less simple ODEs, some other 
argument may be needed. Developing such arguments to generalize this elementary approach 
beyond the SIS-model to more complicated epidemic models is an area for future work. 

Acknowledgements. We thank Peter L. Simon, Istvan Z. Kiss, and two anonymous reviewers 
for helpful comments. 
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Appendix 


Our proof of Proposition 2 is shorter and more elementary than the original proof by Rand (1999), 
but see also Simon et al. (2011) for a more formulaic proof. 

Proposition 2. E[I]' = (t/ n)E[SI] — 7 E[I] and 

E[I 2 }' = (r/n)E[SI((I + l ) 2 - I 2 )} + jE[I((I - l ) 2 - I 2 )]. 

Proof. In network configuration x, the system can either transition to states x + with num/(x + ) = 
num/(i) + 1 or to states x~ with num/(i“) = numj(i) — 1 (i.e., only to states with ±1 infected 
nodes). The aggregate rate for the first is 7 num/(x) and (r/n)num 5 /(a:) for the second, proving 
that 

lim (E[I(h)\I(0) = x\ — num j(x))/h = (t/Vi) num^Ra;) — 7 num/(x). (12) 

h-t 0 

The transition rates and times of /(f) 2 are the same as those of /(f), just now the jump sizes are 
(/ ± l) 2 — I 2 instead of ±1: 

lim(Z?[/(/i) 2 |/(0) =x\ — nuni/(r) 2 )//i 

= (r/n) nuni 5 /(x)((a; + l) 2 — x 2 ) + 7 num/(x)((x — l) 2 — x 2 ). (13) 

Multiplying both sides of (12) and (13) by P[X( 0) = x\ and summing over x (unproblematic since 
it is a finite sum) proves the claims. □ 
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